ENHANCEMENT OF DISCRIMINATIVE CAPABILITIES OF HMM BASED RECOGNIZER THROUGH MODIFICATION OF VITERBI A - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

نویسنده

Jianming Song

چکیده

The algorithm proposed in this paper integrates the concepts of variable frame rate and discriminative analysis based on Tanimoto ratio to modify the conventional Viterbi algorithm, in such a way that the steady or stationary signal is compressed, while transitional or non-stationary signal is emphasized through the frame-by-frame searching process. The usefulness of each frame is decided entirely within the Viterbi process and needs not to be the same for different models. To evaluate this algorithm, we tested a speech database of 9 highly confusable E-set English letters. With 5 state and 6 mixture components, the conventional HMM baseline system only delivered the recognition accuracy of 73.9%. In the preliminary experiment using the algorithm proposed in this paper, the recognition accuracy was increased to 8:2.5%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MICROSOFT WINDOWS HIGHLY INTELLIGENT SPEECH RECOGNIZER: WHISPER - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Since January 1993, we have been working to refine and extend Sphinx-I1 technologies in order to develop practical speech recognition at Microsoft. The result of that work has been the Whisper (Windows Highly Intelligent Speech Recognizer). Whisper represents significantly improved recognition efficiency, usability, and accuracy, when compared with the Sphinx-I1 system. In addition Whisper offe...

متن کامل

Sequential homotopy-based computation of multiple solutions to nonlinear equations

IEEE Intl. Conf. Acoustics, Speech & Signal Processing (ICASSP) May 1995 Homotopy methods have achieved significant success in solving systems of nonlinear equations for which the number of solutions are known and the homotopy paths are bounded. We present a twostage homotopyprocess which does not require a-priori knowledge of the number of solutions to a system of nonlinear equations. This app...

متن کامل

Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement

171 A. Benyassine and H. Abut, “Mixture excitations and finite-state CELP speech coders,” in Proc. IEEE ICASSP., Mar. 1992, pp. 1-345-1-348. P. Krmn and B. S. Atal, “Strategies for improving the performance of CELP coders at low bit rates,” in Proc. IEEE ICASSP, Apr. 1988, pp. 151-154. P. moon and B. S. Atal, “On the use of pitch predictors with high temporal resolution,” IEEE Truns. Acoust., S...

متن کامل

EXPERIMENTAL EVALUATION OF SEGMENTAL HMMS - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

The aim of the research described in this paper is to overcome important speech-modeling limitations of conventional hidden Markov models (HMMs), by developing a dynamic segmental HMM which models the changing pattern of speech over the duration of some phoneme-type unit. As a first step towards this goal, a static segmental HMM [3] has been implemented and tested, This model reduces the influe...

متن کامل

DSP-BASED MOBILE AND SATELLITE RECEIVERS, FROM ALGORITHM TO IMPLEIMENTATION: A DESIGN COURSE AT AACH - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Profound knowledge of the interaction between algorithms and digital signal processor (DSP) architectures is required to be able to efficiently design complex communications equipment. Whereas both algorithms and architecture find treatment in many courses individually, education focusing on design methodology for DSP implementation is found to be rare. This contribution describes a concept and...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

ENHANCEMENT OF DISCRIMINATIVE CAPABILITIES OF HMM BASED RECOGNIZER THROUGH MODIFICATION OF VITERBI A - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

نویسنده

چکیده

منابع مشابه

MICROSOFT WINDOWS HIGHLY INTELLIGENT SPEECH RECOGNIZER: WHISPER - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Sequential homotopy-based computation of multiple solutions to nonlinear equations

Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement

EXPERIMENTAL EVALUATION OF SEGMENTAL HMMS - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

DSP-BASED MOBILE AND SATELLITE RECEIVERS, FROM ALGORITHM TO IMPLEIMENTATION: A DESIGN COURSE AT AACH - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

عنوان ژورنال:

اشتراک گذاری